Description of NTU Approach to NTCIR3 Multilingual Information Retrieval
نویسندگان
چکیده
This paper deals with Chinese, English and Japanese multilingual information retrieval. Several merging strategies, including raw-score merging, round-robin merging, normalized-score merging, and normalizedby-top-k merging, were investigated. Experimental results show that centralized approach is better than distributed approach. In distributed approach, normalized-by-top-k with consideration of translation penalty outperforms the other merging strategies.
منابع مشابه
ISCAS at NTCIR-3: Monolingual, Bilingual and MultiLingual IR Tasks
This paper reports the methods and procedures we took in the CLIR track of NTCIR3. In monolingual subtasks we mainly describe index representation and word segmentation method. We use hybrid model integrating MT approach and dictionary-based approach in bilingual and multilingual subtasks. The method for combining the monolingual retrieval results to produce the final rank list is explained in ...
متن کاملMerging Mechanisms in Multilingual Information Retrieval
National Taiwan University (NTU) Natural Language Processing Laboratory (NLPL) participated in MLIR task in CLEF 2002. We submitted five official multilingual runs. In this paper, we try to resolve the collection fusion problem. We experimented with several merging strategies that merge the results of several intermediate runs.
متن کاملPOSNIR: Probabilistic Natural Language Information Retrieval System
This paper describes our information retrieval system participated in CLIR task of NTCIR3 and reports the results with some observations. Most previous information retrieval models assume that terms of queries and documents are statistically independent from each another. However, independence assumption is obviously and openly understood to be wrong. We presented two method of incorporating th...
متن کاملMIRACLE's 2005 Approach to Cross-lingual Information Retrieval
This paper presents the 2005 Miracle’s team approach to Bilingual and Multilingual Information Retrieval. In the multilingual track, we have concentrated our work on the merging process of the results of monolingual runs to get the multilingual overall result, relying on available translations. In the bilingual and multilingual tracks, we have used available translation resources, and in some c...
متن کامل